超越 Transformer Architecture? Inception Launches the World's First Inference Large Model Based on Diffusion Models - Mercury 2
Inception Labs has released the Mercury2 model, which uses diffusion models instead of the Transformer architecture, achieving a paradigm shift in text generation. This model no longer generates text character by character but processes it as a whole, like editing, aiming to break through the performance bottlenecks of traditional large models.